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Abstract 

We study the efficiency of the incomplete enumeration algorithm for linear and 
branched polymers. There is a qualitative difference in the efficiency in these two 
cases. The average time to generate an independent sample of configuration of poly- 
mer with n monomers varies as n 2 for linear polymers for large n, but as exp(cn°) 
for branched (undirected and directed) polymers, where < a < 1. On the binary 
tree, our numerical studies for n of order 10 4 gives a = 0.333 ± 0.005. We argue 
that a = 1/3 exactly in this case. 

Monte-Carlo (MC) simulations are a very important tool for studying polymers, as 
exact results are hard to come by, and are available only for the simplest models. Broadly 
speaking, MC algorithms fall in two classes [1]: the Metropolis type and the genetic type. 
The Metropolis type algorithms generate a time sequence of configurations of the polymer 
using a Markovian evolution. The transition probabilities from one configuration to the 
next are so chosen that the time average of properties of the system are equal to that from 
the desired distribution. These may use local moves as in Rouse dynamics [2], bi-local 
moves as in the reptation algorithm [3] or nonlocal moves as in the pivot [4] and cut-and- 
paste [5] algorithms. There is inevitably some correlation between different configurations 
generated in an evolution. These algorithms become inefficient if the correlation time 
becomes very large, eg. when simulating polymers in a random medium. 

In the genetic algorithms, one randomly generates a small random number of config- 
urations in each run. The probability that a given configuration is obtained in a run is 
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proportional to the desired distribution. One repeats the process for many runs to get 
a large sample. Examples of this type are the enrichment [6] and the pruned-enriched 
Rosenbluth method(PERM)-like [7] algorithms. 

While there have been many studies of linear polymers using various Monte-Carlo 
techniques like pivot [4, 8, 9], PERM [7], Berreti-Sokal algorithms [10], branched poly- 
mers have been less studied. Algorithms used for simulating linear polymers can often be 
adapted for branched polymers, but they are usually found to be less efficient. For exam- 
ple, in the pivot algorithm, the acceptance probability of the transformed configuration is 
found to be much less for branched polymers than for linear polymers [11]. The algorithm 
does not perform well for branched polymers adsorbed on a surface[12]. The PERM al- 
gorithm also seems to work less well for branched polymers than for linear polymers [13]. 
Incomplete enumeration(IE) is an algorithm belonging to the genetic class of algorithms. 
It has been used for simulating linear polymers[14], and for branched polymers [15, 16]. 

A better understanding of the efficiency of Monte Carlo algorithms for generating 
branched polymers seems desirable. We will study IE for linear and branched polymers in 
this paper. We choose the average computer time T n needed to generate one statistically 
independent sample of desired size n as a reasonable measure of efficiency of the algorithm. 
The dependence of T n on n is very different for IE for linear and branched polymers. 
We find for the linear polymers T n ~ kn 2 , but for branched polymers T n ~ exp(kn a ), 

< a < 1. We also discuss an improvement of IE which we call improved incomplete 
enumeration (HE), in this paper. We find that the improvement does not change the 
asymptotic dependence of T n on n in general. HE works better than IE but the difference 
is only in the coefficient k. 

The plan of the paper is as follows. We describe the IE algorithm in Section 1. In 
Section 2 we discuss the efficiency criterion for MC algorithms in general, and for IE in 
particular. In Section 3 we study the efficiency of IE analytically for some simple cases 
where the genealogical tree has a simple recursive structure. We also study IE for self 
avoiding walks(SAW) in this section. In all cases we find that T n ~ n 2 . In Section 4 we 
propose an improved version of the IE algorithm, HE. For simple random walks T n = n 
for the HE algorithm as compared to T n ~ n 2 for IE. For SAWs, HE is significantly more 
efficient and becomes better in higher dimensions, but asymptotic efficiency remains the 
same and T n ~ a^r? in all dimensions, though the coefficient ad decreases with increasing 
dimension. In Section 5 we study IE for branched polymers or lattice animals on a binary 
tree. We give heuristic arguments and numerical evidence to show that T n ~ exp(/cn 1//3 ) 
for large n for branched polymers on a binary tree. We also study IE and HE numerically 
for undirected and directed branched polymers on a square lattice in this section. We find 
that in both cases T n ~ exp(kn a ), < a < 1. We summarise our results in Section 6. 

1 The Incomplete Enumeration Algorithm 

Self-avoiding walk and lattice animals(LA) are simple lattice models of linear and branched 
polymers in dilute solutions. In order to study the thermodynamic properties of these 
polymers, one has to average over all allowed configurations of the polymer of a given 
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number of monomers. The averages are defined with all configurations considered to be 
equally likely. Since the total number of possible configurations grow exponentially fast 
with size of the polymer, brute-force exact calculation is possible only for small polymers. 
Monte-Carlo methods allows us to study much larger sizes by obtaining a representative 
sample of the set of configurations and estimate the ensemble averages from the sample 
average. 

The IE algorithm is a simple modification of exact enumeration algorithm for generat- 
ing polymers. A good exact enumeration algorithm generates all possible configurations 
exactly once [17]. This is ensured by defining a rule which, given an n-site configuration 
of a polymer, identifies uniquely one of these sites as the 'last added site'. Removing this 
site must result in an allowed polymer configuration of (n — 1) sites. The (n — l)-site 
polymer is called the parent of the n-site configuration. We start by imagining that we 
have arranged all configurations in a genealogical tree, whose nodes are the different con- 
figurations of the polymer, such that all polymer configurations of n sites are at level n 
and are connected to their parent at level (n — 1). Clearly, the tree depends on the rule 
used to define parenthood. For example, Fig. 1 shows a genealogical tree for directed 
lattice animals on a square lattice for n < 4, using one such choice (see Appendix for 
details). In the actual implementation of the algorithm, the whole genealogical tree is not 
constructed first, and tree is constructed and the pruning is decided as we proceed in a 
depth first search. 

As the number of configurations of polymer of size n increases exponentially with n, 
the time required to construct the genealogical tree up-to level n in the exact enumeration 
algorithm increases exponentially with n. The basic idea of the IE algorithm is to decrease 
this time by randomly pruning the genealogical tree. 

In IE we choose a set of (n — 1) real numbers pi (0 < < 1), for i = 1 to (n — 1). Any 
bond in the genealogical tree connecting level r to level (r + 1) is removed with probability 
(l—p r ) independent of the other bonds. If a configuration gets disconnected from the root 
node, automatically all its descendants are also removed. We make a depth first search of 
the pruned genealogical tree up-to depth n to determine the different configurations that 
remain at level n. We run the algorithm several times to generate a large sample. The 
probability of enumeration of a particular r site configuration in a given run is 

1=7 1 

s r = n Pi 

i=i 

This is same for all configurations of size r. This ensures that the sample of con- 
figurations obtained is unbiased. As a configuration can occur at most once in a single 
Monte-Carlo run, IE samples the population without replacement. 

The different runs are mutually uncorrelated. However, the number of configurations 
produced within one run varies from run to run, and different configurations produced in 
the same run are correlated. Also, the fraction of runs in which one generates at least one 
configuration of size n goes down with increasing n. 

In case of SAWs which model linear polymers, there is a natural labelling scheme in 
which one just labels the first point of walk by 1, the second by 2 and so on. In case of 



3 



branched polymers there are several different choices of labelling possible corresponding 
to different possible rules of removing a site from a n-site cluster to generate a (n — 1)- 
site connected cluster. We have used the Martin's labelling scheme [17] for our cluster 
counting algorithms. A brief description of this can be found in the Appendix. 



2 Efficiency 

In general, in Monte-Carlo methods, the time needed to estimate an ensemble average 
H — (O) of some observable O over all clusters of size n averaged over N independent 
samples would give estimate as /i* = /i ± a/\^N, where a 2 is the variance of O. If 
correlations are present, the average time required to estimate \x within the fractional 
error e varies as ((j/e/i) 2 r, where r is a measure of correlations in the data. For Metropolis 
evolution, r is the auto-correlation time of the observable O. In the case of IE, the 
efficiency depends on the average time taken by the Monte Carlo algorithm to generate a 
single run and the degree of correlations present in the different samples produced in the 
same run. 

It is difficult to determine the latter exactly for IE. It depends also on the quantity we 
want to average. Consider a set of configurations generated by N independent runs of the 
IE algorithm. Let the probability that a single run produces at least one sample be P(n), 
and the average number of configurations produced per run be a. Then for large N, we will 
generate approximately Na configurations, which will be made of approximately P(n)N 
mutually uncorrelated groups. Thus the average size of a correlated group is a/P(n). It 
seems reasonable to measure the efficiency of the algorithm in terms of the average CPU 
time required to produce one independent group of configurations. This overestimates 
correlations as this treats all samples produced within one run as fully correlated 1 . 

Other definitions of efficiency are possible, and may be advantageous in specific con- 
texts. For example, one may be interested in some asymptotic properties of the polymer 
problem, like the branching number A, or the critical exponent 9. In this case, the value of 
n is not decided beforehand, and the desired estimate is obtained by suitable extrapolation 
of data for different n. We can study average number of descendants < X n >m \{l — 9/n) 
to estimate A and 9. Analysis of errors in such quantities is more complicated, and will 
not be discussed here. 

Let T n be the average CPU time required to obtain one run which generates at least 
one configuration of size n. If r n is the average CPU time for one Monte-Carlo run, then 
we have 

1 For example, the mean is 2749.2 and the standard deviation a is 1349.2 for the radius of gyration of 
animals of size 50 for the full population. The average number of samples produced per successful run of 
MC simulation was 27.5. If we calculate the average radius of gyration of 100 samples of 10 4 consecutive 
runs, the standard deviation a , of the mean calculated is 144.2. This would have been cr/100 if they were 
uncorrelated. Similarly, for SAW of size 50 on a square lattice the average number of samples produced 
per successful run is 5.3 and a = 60.44, and a = 1.4 for 100 samples of 10 4 consecutive runs. 
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The average CPU time required for one run is estimated easily in terms of the time 
taken to add or delete a configuration on the genealogical tree. We define this to be one 
unit of CPU time. 

The total CPU time for one MC run is proportional to the number of nodes in the 
pruned genealogical tree. Let Xj denote the random number of j site configurations 
generated in a single run. The time to visit sites of the randomly pruned tree up-to depth 
n is YJj=\ Xj. The CPU time in a run is then proportional to the number of nodes in the 
pruned tree. The average CPU time per run r n , would be equal to the sum of average 
values (Xj), averaged over all runs. 

n 

r n oc < x j > (3) 

3=1 

For linear and branched polymers, the total number of configurations A n of a given 
size n is known to vary as 

A n ~ A\ n n- e (4) 

for large n. Here A is a constant, A is called the growth constant and 9 is a critical 
exponent. Since each configuration with n sites has a probability S n (Eq. (1)) of being 
generated, and there are A n total number of configurations, (X n ) = zinA n , giving 

n 

T n = X! A?'".?' (5) 
J'=l 

Since (X n ) can be directly estimated in IE, we get a way to estimate the number of 
configurations (X n ) by simulations. This can be used to estimate the A and 6. 

A study of the efficiency of the algorithm is complicated as P{n) depends on the 
structure of the genealogical tree, and is difficult to determine theoretically. 

An upper bound on working of these algorithm is the time for exact enumeration of 
all the samples, which is exponential in n. Consider the case in which Pi = p for all i. So 
long as pX > 1, (X n ) will grow exponentially with n. As P(n) < 1, this implies that T n 
increases exponentially with n if p\ > 1. Also, if p\ < 1, then P{n) varies as {p\) n to 
leading order, but r n remains finite (r n > t\) [18]. Thus again T n increases exponentially 
with n. These two considerations together imply that a good choice of p is that it should 
be approximately equal to 1/A. However, finding the optimal choice of {p^} for a given 
problem is non trivial. We investigate this in the next section for some illustrative cases. 

3 Optimising the IE algorithm 

3.1 Systems with Uniform genealogical tree 

The simplest of enumeration problems is the enumeration on a uniform genealogical tree. 
For example, random walks which are models for linear polymers without self exclusion 
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correspond to a uniform genealogical tree of branching number A. The number of nodes 
at level n is A™" 1 . 

Consider a uniform genealogical tree with two descendants per node. In this case 
number of nodes at level n would be 2 n_1 . For the choice of {pi}, the probability of 
connection of root with level r, denoted by P(r) follows a simple recursion relation 

P(r + 1) = 2p r P(r) - p 2 r P 2 (r) (6) 
with P(l) = 1. The average CPU time per run r„ is given by 

n 

r n = l + £2 i - 1 S i (7) 

i=2 

First we try to find out as to what choice of p^s minimises T n for small n. 
For small sizes one can try systematic optimisation. Let us choose n — 2. Then on 
the binary tree, P(2) = 2pi — pi 2 and r 2 = 1 + 2p±. This gives 



= (8) 



2pi + l 
2pi - pi 2 

Minimising with respect to pi, we get the minimum value ofT 2 tobe(3+ v / 5)/2 « 2.618 
for pi = (-s/5- l)/2 w 0.618. 

Similarly, the time (T 3 ) of IE for reaching level 3 from level 1, is given by 

3 2p l (2p 2 -p 2 )-p 2 (2p 2 -p 2 ) 2 W 

It is easy to check that T 3 in this case takes its minimum value for pi = 0.534 and 
p 2 = 0.618. Similarly for n — 4, the minimum occurs at p\ = 0.516, p 2 = 0.534 and 
P3 = 0.618. For large n, the best choice of pi tends to 1/2. By optimising till n = 30, 
we find that the best choice of pi is quite well described by the approximate formula 
Pl w |(1 + 0.5/(n-i) 2 ). 

For large r, if p r — ► p*, Eq. (6) can be approximated by P(r + 1) = 2p*P(r) —p* 2 P(r) 2 . 
For 2p* < 1, we get P(r) — > (2p*) r decreases exponentially with r. For (2p*) > 1, it leads 
to P*(oo) ~ (2p* - 1). 

We have already argued that pi should be close to 1/A, else the algorithm is inefficient, 
T n varies as exp(n). Consider now the case where pi = j(l + a/i m ), where a and m are 
parameters that we can vary to find the optimal values. In this case, (X n ) = + Q: / i m ) , 
and P(n) is approximately given by 

Then , if m > 1, we see that (X n ) tends to a constant for large n, and r n is proportional 
to n. Also, P(n) varies as 1/n, and we have T n ~ n 2 . 

If m = 1, and — 1 < a < 1, then varies as n a , and hence r n ~ n a+1 . Also, Eq. 
(10) gives P(n) ~ A(l — a)n a ~~ l . Interestingly, in the T n , these powers cancel and we get 
T n = T n /P(n) ~ C a n 2 . We find that C a ~ 1/(1 — a), hence the best choice of a is a = 0. 
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If m < 1, then (X n ) varies as exp(n 1_m ), and P(n) varies as n~ m , and hence T n varies 
as exp(n 1 ~ m ) to leading order, thus in this case m < 1 leads to a suboptimal performance 
of the algorithm. 

On a binary tree for p t = |, we get T n = n 2 /4. From systematic optimisation we 
saw that there exist a nontrivial optimal value for each pi which depends on the depth 
of the genealogical tree to be reached. This value for uniform binary tree was pi ~ 
|(1 + 0.5/(n — i) 2 ). But even with this choice for large n we get T n xs n 2 /A. This result 
is generalised straight forwardly to /c-node uniform tree. For the choice, pi = 1/kWi, we 



3.2 Systems with recursively defined genealogical tree 

It is necessary to check how non-uniformity of trees can change the above conclusions. The 
simplest of non-uniform trees are the recursively defined trees. The number of branches 
from a given node still follow a definite pattern which repeats and depends on the coor- 
dination number of the parent node. We consider some examples 

A node with k descendants will be called a A;-node. Consider a tree specified by the 
rule that the descendants of a 2-node are a 2-node and a 3-node, and the descendants of a 
3-node are one 2-node and two 3-nodes. We specify such a tree by the notation (23, 233) 
tree (Fig.l). If B 2 (n) and B 3 {n) are respectively the number of nodes at level n — 1 which 
have 2 and 3 descendants respectively, then 



B 2 (n) = B 2 (n-l) + B 3 (n - 1) (11) 
B 3 (n)=B 2 (n-l) + 2B 3 (n-l) (12) 

From these linear recursion equations it is easy to see that B 2 (n), B 3 (n) and also the 
total number of nodes at depth n, A n , all grow as (A)™ for large n, where A = (3 + \^E)/2). 

We now look at the efficiency of IE on this tree. Take all pi = p. We define P 2 (r) 
and P 3 (r) as the probabilities that a 2-node and a 3-node respectively are connected to 
at least one node r levels below. Clearly they have the following recursions 

1 - P 2 {r + 1) = (1 - pP 2 {r)){\ - pP 3 {r)) (13) 
1 - P 3 (r + 1) = (1 - pP 2 (r))(l - pP 3 {r) f (14) 

with P 2 (l) = P 3 (l) = 1. 

For large r, near the fixed point we get P 2 (r) j^P^r). Substituting in the second 
equation, we find the linear term vanishes for p — 1/A and the difference equation can be 
approximated by dP 2 /dr ~ —P 2 , which implies that P 2 (n) and P 3 (n) decay as 1/n for 
large n. We get P 2 (n) « tj^- The total CPU time at p = 1/A is (5+ ^ )w . It gives the 

upper bound on time per independent run to be ^t^z^t^ n 2 ~ 0.382n 2 . 

We can similarly analyse the other recursively defined trees. Consider for example, 
the tree given by the rule (23,223). We find that growth constant A is 2.4142 and for 
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Pi = 1/A for IE this gives T n « 0.396n 2 . On a (33,233) with growth constant 2.732 for 
Pi = 1/A for IE this gives T n = n 2 0.35n 2 . It is easy to convince oneself that for 

all recursively defined trees we get T n ~ n 2 . 

It is instructive to see the results of systematic optimisation over {p^} in case of 
non uniform trees. Similar analysis for (23,233) tree (Fig. 1) between level 1 and 2 
gives pi = 0.618. Similarly optimising T 3 between level 1 and 3 gives p\ = 0.562 and 
p 2 = 0.484. An optimisation between level 1 and 4 gives the best values of p\s to be 
Pi = 0.562, p 2 = 0.42 and p 3 = 0.467. We see that the optimal value of pi in this case 
depends on n. By optimising till n = 30, where n is the depth of the genealogical tree, 
we find that for tree levels away from root and bottom, optimal value of pi approaches 
1/A with increasing % and the asymptotic behaviour of algorithm remains the same as 
long as we choose Pi ~ 1/A. The optimal Pi values as a function of i are plotted in Fig.2. 
The optimising value of Pi are a bit higher than 1/A near the two ends i — 1 and i = n. 
This extra optimisation does not change the T n ~ Kn 2 dependence, and infact does not 
change the asymptotic value of K either. 

The incomplete enumeration algorithm generates a bond percolation process on the 
genealogical tree, where each link is present independently with a probability p. We define 
the percolation threshold p c on the tree to be such that for all p > p c , there is a non zero 
probability that the starting node belongs to an infinite cluster. For p < p c the probability 
of connection between root and level n usually goes down exponentially in n. At p c it is 
expected to decrease as a power law in n and for p > p c it takes a finite value in the limit 
of n — > oo. The p c on a tree is bounded from below by 1/A [19]. For the genealogical trees 
which we discussed, the p c was equal to 1/A and the optimal behaviour of the algorithm 
was achieved for p^ ss 1/A = p c . 

3.3 Self avoiding Walks 

We now consider IE for SAW. For a SAW on a d dimensional lattice, the number of 
configurations A n ~ A n n 7_1 , where A is a lattice dependent constant and 7 depends only 
on the dimension. The exponent 7 is known to be 1 for d > 4, and 7 = 43/32 for d = 2 [20]. 
The exact value of A is known for the hexagonal lattice [21], and fairly precise numerical 
estimate, which matches well with root of a quartic equation with integer coefficients is 
known on the square lattice [22]. 

The genealogical tree for SAW is not uniform. For example, for rooted SAW(one end 
fixed at origin) on a square lattice, the number of different allowed choices of the n th 
step for n > 1 varies from to 3, depending on the walk. In this case it is difficult to 
determine the probabilities of connection up-to level n analytically but we have estimated 
P(n) numerically by simulations. We choose Pi = A _1 (l + 1/i) 1 ^ 7 , so that on the average 
we get order one configurations of size n per run for large n. With this choice of pi our 
numerical simulations show that the probability of reaching level n goes down as 1/n and 
hence whenever level n is reached, on an average ~ n SAWs of size n are generated. This 
also implies that p c is indeed 1/A on the SAW genealogical tree. We did 10 6 Monte-Carlo 
simulations and generated walks up-to size 10, 000 on a square lattice. We have plotted 
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T n in Fig 3. Our numerical fit suggests T n for IE to be (0.42 ± 0.01)n 2 . 

In 3 dimensions A = 4.6839 and 7 = 1.16 [20] and nearly 90% nodes have coordination 
number 5. Hence the tree is more uniform than the 2d case and we get T n m 0.43n 2 (Fig. 
3). 

The genealogical tree becomes more and more uniform as we go to higher dimensions. 
In general on a d dimensional hyper-cubic lattice the maximum branching possible is 
2d — 1 and in the limit d — > 00 the growth constant has an expansion [20] 

x - 2i - l -T d -jk~ (15) 

Hence the dominant branching is 2d — 1 and probability of a node branching into 2d— 1 
branches increases with dimension, and the lower branching numbers occur with much 
smaller frequencies. The probability of connection to level n is hard to obtain analytically 
for any d. 

In Fig. 3 we have also shown a plot of efficiency of IE in 3 and 4 dimensions for SAW. 
In few hours one can simulate 10 5 Monte-Carlo runs for walks of size 1000 on a Pentium-4 
machine. We get T n ~ n 2 for 2,3 and 4 dimensions. This leads us to conclude that the 
small non uniformity of the genealogical tree is unimportant and T n varies as n 2 in all 
dimensions for SAW. 

We note that for SAWs, other algorithms like pivot are known to be more efficient. 
For pivot algorithm the correlation time for end to end length varies as n x with x < 1 in 
two dimensions [9]. However, if we want to study some variable like correlations in the 
directions of consecutive steps of the walk, the correlation time will have to satisfy the 
inequality, T n > n, as one would need to update each step about 0(1) times to affect the 
nearest neighbour correlations. 



4 Improved Incomplete Enumeration (HE) 

The main limitation of IE is attrition: the probability of generating n-site configurations 
in a given Monte-Carlo run goes down with n. One way to increase the probability of 
survival is to redistribute weight amongst the descendants in such a way that while the 
probability that a particular node is selected remains same as before, the probability that 
at-least one of the descendants is chosen is increased. We call this 'Improved Incomplete 
Enumeration(IIE)'. 

Suppose in the implementation of IE as outlined in Section 1, we come to a node 
with degree j. Then in IE, each link is independently deleted with a probability (1 — 
p), and the probability that all links are deleted is (1 — pY , which is non zero, even if 
the expected number of descendants of this node is pj > 1. In HE, the links are not 
deleted independently. The probability that any given node is selected remain p, but the 
probability that at least one node is selected increases. This is implemented as follow: If 
there are j descendants of a node and each link downward is present with probability p, 
then we choose Int(pj) edges at random and give them weight one, and select one of the 
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edge out of the remaining j at random and give it a weight one with probability frac(pj) 
and delete all the other edges. 

Hence we see that in HE, though the average probability of selection of an edge remains 
p, but it enhances the probability of connection between two level of the genealogical tree 
and hence the probability of success in a given Monte Carlo run. For example, as will be 
discussed in the next section, on a regular tree with p — 1/A, the probability of connection 
up-to n levels below in HE is exactly one whereas it goes as 1/n in IE. 

4.1 Systems with recursively defined genealogical tree 

In HE one redistributes the sum of probabilities of connection from a node to the next 
level. On a uniform binary tree iji = 2 \/i and with pi = 1/2, yiPi = 1 and hence for 
Pi = 1/2 with HE probability of reaching any level n of the tree after n steps is exactly 
1 and exactly one configuration of any given size is generated in the process and hence 
T n = n. With pi = 1/k this result holds for any k node uniform tree. Clearly pi = 1/k is 
the best choice in this case, as an absolute lower bound on time T n of the algorithm is n. 

If we use the improved algorithm for a (23,233) tree, (X n ) and hence the average CPU 
time per run will remain the same. We can also determine the connection probabilities 
P2(n) and P 3 {n). The coupled difference equations for P2(r) and P 3 (r) have no cubic 
term. The recursions are 

P 2 (r + 1) = p(P 2 (r) + P 3 (r)) (16) 
3o — 1 

P 3 (r + 1) = p(P 2 (r) + 2P 3 (r)) - Ji—(2P 2 (r)P 3 (r) + P 3 2 (r)) (17) 

which at p = 1/A = p c gives P2(n) varying as 1/n for large n. The time per independent 
run comes out to be A ( 3 ~ A ) ps | times that in incomplete enumeration. That is, HE is 
nearly three times more efficient than IE. 

HE certainly works better than IE. But, except for the uniform tree, the difference 
between IE and HE is only in the coefficient of n 2 . While performance of HE improves as 
the genealogical tree becomes more and more uniform, there is no qualitative difference 
in the efficiency of IE and HE on a recursively defined non uniform tree. 

4.2 HE for SAW 

We studied HE on a d dimensional hyper-cubic lattice for d — 2 to 10. 

HE enhances the performance of the algorithm by increasing the probability of connec- 
tion between root and level n. For SAW on a square lattice, Fig. 4 shows the probability 
of connection P(n) for IE and HE both. P(n) is roughly 3.5 times bigger for HE. In two 
dimensions, T n is of order 0.12n 2 for HE. In three dimensions the performance is even 
better and T n m 0.056n 2 , which is roughly a factor of 7.5 less than the time taken by IE. 

In general we find on a d dimensional hypercube HE has a efficiency T n = ddii 2 where 
a d is a decreasing function of dimension for generating SAWs. Fig. 5 shows the plot of HE 
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for dimensions 2 to 10. The memory requirement of the algorithm just increases linearly 
with system size in all dimensions and we could perform 10 5 MC runs for walks up-to sizes 
1000 in few hours of computer time on a Pentium-4 machine. We find that decreases 
as d~ 2 approximately, i.e the algorithm performs better with increasing dimension. 

We conclude that for IE and HE for SAW, T n = a^r? . The probability of connection 
between root and level n does not depend on 7. It depends only on the non- uniformity of 
the tree. The genealogical tree is more uniform in higher dimensions and the constant aa 
depends on dimension. For IE, the change in a<2 with dimension is quite insignificant. But 
ad can be decreased significantly by redistributing weights. This is a strong numerical 
evidence that the performance is always 0(n 2 ) independent of the dimension and 7 for 
linear polymers. 

A further enhancement can be achieved by choosing the pruning only after looking 
deeper, but we found that because of the increase both in the memory requirement and 
in the CPU time to generate one configuration, there is no net gain over HE. 

5 Lattice Animals and Branched Polymers 

In this section we will study the IE algorithm for branched polymers. Since the efficiency 
of IE is polynomial in n for linear polymers, it seems plausible that it will be so also 
for branched polymers. There are two important ways in which the genealogical tree 
for branched polymers differ from that for linear polymers. There are several equally 
reasonable, computationally easy to implement choices of rules to define parentage, and 
in all of them the degree of a node is not bounded. The number of possible descendants 
of a node is of the order of its perimeter sites and hence the maximum of the degree of 
nodes at level n increases linearly with n. The average number of descendants A is of 
0(1), and the number of nodes with large branching number is exponentially small. But 
this makes an important difference in the fluctuations of the number of animals of a given 
size generated in a given run. 

The structure of genealogical tree for lattice animals is more complex than for self- 
avoiding walks. We studied the algorithm on genealogical tree obtained by using Martin's 
labelling scheme [17]. We have tried two or three variations of the priority rules, and our 
results are insensitive to these changes. 

5.1 Lattice animals on a Binary tree 

We first discuss our results for the animals on a binary tree. This simple case is more 
analytically tractable. The generating function of total number of lattice animals on a 
binary tree is well known [19] and it is A(y) = A r y r = (1 — y/1 — Ay)/2y, where A r is 
the total number of animals with r sites. A r are the Catalan numbers, which come up in 
many other contexts in combinatorics [23]. For large r this gives A r ~ 4 r r~2. The growth 
constant A in this case is 4. 

The number of descendants of a node at level r in the genealogical tree for this problem 
lies between 2 to (r + 1). In this case the genealogical tree is easily characterised: The root 
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site is a 2— node. A /c-node has k descendants, and the degree of these descendants are 
k + l,k, ....3, 2 respectively. This is seen as follows: the node corresponds to a branched 
polymer with k unblocked perimeter sites, which are ordered by some priority rule. The 
m th descendant of this node is a node of degree (k + 2 — m) and corresponds to first (m — 1) 
perimeter sites blocked, m th site occupied and {k — m) allowed for further occupation. 
Since on a binary tree every site has two downward neighbours, hence we see that a /c-node 

will give rise to nodes with k + 1, k, 2 descendants. For example, in Fig. 6, the top 

node corresponds to an animal of one site, and has two growth sites. If first of these two 
sites is occupied, then the corresponding animal has three growth sites. If it is blocked it 
has two growth sites and so on. 

The total number of nodes at a level r is equal to A r . Let B r (k) is the number of 
/c-nodes at level (r — 1). We can determine the distribution of the branching number. We 
find that B r (k) satisfy the following relation 

fc-2 

B r (k) = A r _ 2 -J2B r - 1 (s) (18) 

s=2 

As r — > oo, 1/4 of the nodes at a level have 2 offsprings and 1/4 of the total nodes 
have 3 offsprings. And level r has exactly one node with degree (r + 1). For k > 4, it can 
be shown that in the asymptotic limit (r — > oo), the fraction of nodes having k offsprings 
is (k - l)/2 fc for r >> k. 

To find the efficiency factor T n , we have to determine the probability of connection of 
root to a level. If P(k, r) is the probability of a node with k offsprings to be connected 
to at-least one node r levels below it, then P(k,r) has a recursion 

fc+i 

P(k,r + 1) = 1 - -pP(s,r)) , k = 2 to oo (19) 

s=2 

with initial conditions 

P(k,l) = IV k > 2 (20) 

and p is the probability with which we choose any edge of the tree. P(2,r) will give the 
probability of connection of root to level r on the genealogical tree. Eq. (19) is a nonlinear 
equation. This equation can also be written as 

l-P(k,r) = (l-P(k-l,r))(l-pP(k + l,r-l)) k > 2 (21) 

This equation is also valid for k = 2 if we choose the convention that P(l,r) = 
pP(2,r- 1). 

These equations have the following properties: 

1. For p < 1/4, P(k,r) tends to zero as r tends to infinity exponentially fast for any 
fixed k. In fact, if we consider r as a time like variable and k as space like variable, 
then P(k,r) has a travelling front solution in this regime (P(/c,r) = F{k — vr)). 
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2. For p — 1/4, the velocity of travelling front goes to zero. The distance moved by the 
front increases as r 1 / 3 and P(k,r) ~ F(k — r 1 / 3 ). As F(x) ~ exp(x) for x — > — oo, 
this implies that P(2, r) ~ exp(— cr 1//3 ) for large x. 

3. For p > 1/4, as r goes to infinity, P(k) tends to a non trivial fixed point function 
P*(k) greater than zero. 

This may be seen as follows. The fixed point equation in terms of fixed point variables 
P*(k) is 

1 - p*(k) = {1-P*{k- 1))(1 - P P*{k + 1)) (22) 

Clearly, P*(k) = V/c is a trivial fixed point of this equation. For p > 1/4, there is a 
non trivial fixed point with P*(k) non zero monotonic increasing, with P*(k) pa 1 — (1— p) k 
for large k. However, a closed form solution for any p > 1/4 is difficult. 

On numerically iterating Eq. (19) in r, we find that the equation has a travelling front 
solutions for p < 1/4 and has nontrivial fixed point for p > 1/4. 

Eq (22) has two stationary solutions, i.e P*(k) = Vik and P*(k) = OVA;. For p < 1/4, 
P*(k) = is the stable solution while P*(k) = 1 is an unstable solution. Our initial 
conditions given by Eq (20) are steep. Starting with these initial conditions, on numerical 
iteration we find that as r increases, a front separating stable solution P = and unstable 
solution P = 1 moves in the forward direction. From the translational invariance of Eq. 
(19) one expects a running wave solution. We find that the front moves with a constant 
velocity and hence, P(k, r) for large k and v must tend to the asymptotic form 

P(k,r) ~ F(k-vr) (23) 
We define k*(r), the width of the front by the equation, 

P(F(r),r) = i (24) 

Fig. 7 shows a plot of numerically determined P(k,r) with respect to k — k*(r) for p 
near 1/4. Curves for p below, above and at p = 1/4 all collapse on the same line. Actually, 
a travelling front for P(k,r) as defined by Eq. (21) exists for all k, — oo < k < oo, if we 
take boundary conditions such that P{— oo, r) = and P(oo,r) = 1. 

At p — 1/4, the velocity of the travelling front is zero. If we plot P(k + l,r) as a 
function of P(k, r), we find that as r increases the graph approaches a limiting form. Thus 
for the asymptotic wavefront, P(k + l,r) is a single valued nonlinear function of P(k,r). 
We have plotted these values for different r in Fig. 8 and they all are very close and 
seem to lie on the same curve. Hence if we start from a point on this curve and iterate 
the fixed point equation Eq. (22) with p — 1/4, we generate a travelling front. We have 
not been able to deduce the functional form of this function, which corresponds to a first 
order difference equation for P*(k) from the second order equation Eq. (22). Eq. (22) 
turns out to be a stiff equation and one has to be careful while iterating it in increasing 
k direction. We iterated Eq. (22) starting with different sets of values of P*(k + 1) and 
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P*(k) given by Fig. 8 and found the equation yields a travelling front same as the one 
shown in Fig. 7. 

We could not solve the full non-linear difference equation Eq. (19). Keeping only the 
terms linear in P will give an upper-bound on P(k,r + 1), i.e 



k+l 



P(k,r + 1) <pJ2 p ( s > 



(25) 



s=2 



We can represent this set of equations in matrix form also. Hence if V r represents the 
infinite column array with k th entry being P(k, r) then 



V r < p r M r V 



(26) 



where M is the transition matrix. If A m is the largest eigenvalue of M then for p < 1/A m , 
in the limit of r — > oo, V* will be 0, i.e P*(k) = for all k, and for p < 1/A m . 

The elements Mjj of the transition matrix M are such that, Mjj = 1 for j < (i + 1) 
and otherwise. If we truncate M beyond n x n (M n ), then the determinant D n of M n 
comes out to be 



D n = A(X) 



X l 



x 2 



(27) 



1/Jl — visa coefficient which does not depend 



with xi, X2 — ^ ± y 1 — |, and A(A) 
on n. Then equating D n = in the n — > oo limit gives \ rn = 4. This implies that for 
p < 1/4, P(k,r) will decay exponentially with increasing r and Eq (25) will work well. 
Hence, by definition percolation threshold p c of this tree is 1/4. 
The linearised recursion can be solved explicitly, and we get, 



P(k, r) = p r 



r[£; + 2r- 1] 
T[k + r- l]T[r + 1] 



which for large r gives 



P(k,r) 



1 



A\fivr 



exp 



In 2 [k + r 



ln(4p) ~ 



(28) 



(29) 



If we assume a travelling front solution of kind P(k, r) oc exp(A(/c — f r)) to be valid in 
the tail of the distribution, then substituting in linearised recursion (Eq.(25)), for a given 
p we get a spectrum of travelling wave like solutions parametrised by A with the velocity 
v of the front given by 



A 1 "" 



exp (—A) 
P 



- 1 



(30) 



In this case, it is known that the front actually chooses a unique velocity given by minimum 
of right hand side of Eq. (30) with respect to A [24] . The front velocity is given by 
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2exp(-A*) - 1 

1 - exp(-A*) v 1 

where A* is the solution of the transcendental equation 

zV-exp(-A-) exp(-A-) 

A* p 1 - exp(-A*) v 1 

Near p — 1/4, we can take v ln(4p)/ln(2) and A « ln2. Travelling front solutions 
have been found in a large variety of problems in physics [25]. 

The linearisation of Eq (19) would be valid only for p < 1/4 and k < k (r). Beyond 
that, linear solution will grow beyond one whereas the solution of the full nonlinear 
equation will saturate to 1. Here k (r) is the value of k at which P(k,r) given by Eq. 
(29) becomes of 0(1) and is equal to 

-rin(4p) 1 
k o(r) = — — , M p < - (33) 

At p = 1/4, the asymptotic velocity of the front is zero and the front advances as a 
sub linear power of r. This is the critical point of the percolation on this tree, and Eq. 
(29) gives a algebraic decaying solution for sufficiently small k. This is only an upper 
bound to the actual value. On numerically iterating Eq. (19) for r upto order 10 4 , we 
found unexpectedly that it decays as a stretched exponential in r. 

The fixed point equation as given by Eq. (22) is again a nonlinear equation. To 
find the dependence of probability of connection of root, P(2,r), on the width of the 
front we solved the linearised fixed point equation. On solving, we find that it goes 
as 2~ /c * (r ) for large r, where k*(r) is the width of the distribution. Hence in general, 
P(2,r) ~ exp(-afc*(r)). 

We further studied the width k*(r) of the front as a function of r for different values 
of p. At p = 1/4 we found k*(r) ~ rs. Fig. 9 shows a plot of k*(r) as a function of 
r 1//3 . For p = 1/4, the plot is a straight line. This implies that P(2,r) ~ exp(— cr 1 / 3 ) at 
p — 1/4. For p < 1/4, k*(r) varies linearly with r and tends to a constant for p > 1/4. 
We can directly iterate Eq. (19). In Fig 10 we have plotted —log(P(2,r)) as a function 
of r 1 / 3 which comes out to be a straight line. Fig 9 and Fig. 10 are strong numerical 
evidence that the probability of connection goes as exp(— cr a ) for branched polymers on 
binary tree. Our numerical studies give a = 0.333 ± 0.005 and c = 2.47 ± 0.01. 



5.2 Heuristic argument for the stretched exponential behaviour 

of P(n) at p = 1/4 

We now present a heuristic argument to understand why k* varies as r 1//3 at p c . Let us 
consider a genealogical tree of lattice animals on a binary tree, in which nodes with more 
than k descendants are deleted. We denote the probability that the maximum degree of 
a node connected to root down to level r is k m , by H 2 (k m ) and the probability that a k m 
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node is connected to at least one node r level down on the truncated genealogical tree by 
Jk m (r)- 

Now on a truncated tree, transition matrix M is no longer infinite. It is now a k m xk m 
matrix with Mjj = 1 for j < (i + 1) and otherwise. Here M it j represents the i th row 
and j th column entry of M, and we find the critical value of p which is just inverse of the 
largest eigenvalue of M to be a function of k m and is equal to 

p c (M = i(l+W(^l)) (34) 
For p < p c (k m ), Jk m ( r ) decays exponentially with r. In large r limit it is given by 

Jk m (r) ~ exp(rlog(p/p c (k m ))) (35) 

At p — 1/4, we get Jk m { r ) ~ exp(— br/k^J, where b is a constant. 
It is easy to get a lower bound on H 2 (k m ), as a order k m node occurs first time at level 
k m and probability of connection of root to this node is p km . Hence 

H 2 (k m ) > p k - = 4~ k ™ (36) 

Hence, since p = 1/4 is less than p c (k m ) for any finite k m , Jk m (j') ~ exp(— br/k^), 
where b = 7r 2 . Since H 2 {k m ) > exp(—a p k m ), for large r we get 

P(2, r) > max 

km 

which gives 

P[2,r] > exp{-cr 1/3 ) (38) 

where c = ^{2ba^)a. If we take H 2 (k m ) to be as given by Eq. (36), we get an lower 
bound on P(2,r). Taking b = ir 2 and a p = log4 we get c = §(27r 2 Zog 2 4) 1/3 = 5.04. This 
should be compared with the numerical estimate c = 2.47. 

Thus our numerical simulations and qualitative arguments show that probability of 
connection goes down as a stretched exponential at p — 1/4, the p c of the genealogical 
tree of lattice animals on binary tree as opposed to r _1 decay for linear polymers. So if 
we chose Pi = 1/4 Vi, then (X r ) ~ r -^l 2 an d hence the average computer time to generate 
one statistically independent sample of size r, T r would go as exp(cr x / 3 ) to leading order. 

Clearly the algorithm is not working well and one would like to enhance its efficiency 
if possible. We tried to study the algorithm by choosing pi such that its asymptotic value 
is 1/4. We chose Pi = + and studied T r as a function of x and m. 

As argued earlier, taking m = 1, we can change r r and P(r) by multiplicative factors 
which are powers of r. This will not make much of a difference, as the leading dependence 
remains exp(cr 1 / 3 ). Using m < 1, seems to be more interesting. 

For m < 1, the average CPU time per Monte-Carlo run would vary as exp(xr 1 ^ m ) . In 
case of linear polymers, we saw that time complexity of the algorithm for m = 1 for any x is 



exp I dpK m ^ 2 



(37) 
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polynomial in r. Hence, m < 1 was clearly a bad choice. But in the case of lattice animals, 
this increase in numerator is exactly cancelled by a corresponding increase in P(r). For 
2/3 < m < 1, T r increases as exp(a;r 1_m ) and P{r) varies as exp(— cr 1 / 3 + xr 1 ^' m ) to 
leading order for large r. These cancel to give T r ~ exp(cr 1,/3 ) independent of m. To 
monitor the behaviour of various prefactors, we study this numerically. Fig. 11 shows 
plot of T r for r = 1000, for m = 2/3, 5/6 and 1 as a function of x. For 1 > a > 2/3, to 
leading order T r goes as exp(cr 1 / 3 ), but there exist a non trivial value of x at which T r is 
minimum for a given m. If we look at T r at best value of x for m = 2/3, 5/6 and 1, we 
find that as r increases the difference is not significant. 

Hence we conclude that to leading order, T r . ~ exp(cr 1 / 3 ), for the best choice of p. For 
all 2/3 < m < 1, there exist a range of x for which the time complexity of the algorithm 
will remain qualitatively the same. 

5.3 Lattice Animals on a 2 dimensional square lattice 

We also studied the efficiency for lattice animals on a square lattice. From exact series 
enumeration the A r is known to vary as A r with A « 4.06257 [26]. In this case also the 
number of offsprings a node at level r can have is 0(r) and the genealogical tree in this 
case though more complicated, is qualitatively similar. Numerically, we find that the 
probability distribution of number of descendants k (of a randomly chosen node) has a 
maximum at k — 4, with Prob(k — 4) « 1/4. We enumerated lattice animals up-to sizes 
1000 using IE with 10 6 Monte-Carlo runs. It took time of order one day on a Pentium-4 
machine. With HE we generated samples of size 2000 with 2 x 10 6 Monte-Carlo runs in 
2-3 days time. These sizes are of same order as those produced using the cut and paste 
type algorithms. 

In this case, we find that P(r) has the stretched exponential form P(n) ~ exp(— cn a ), 
with a « 0.4 for both IE and HE. Fig. 12 shows [— logP(r)] varies approximately linearly 
with r 0A . We also studied the directed lattice animals (DA) on a square lattice. In this 
case we find that, a = 0.32 ± 0.02 (Fig. 13). 

6 Discussion 

We find the efficiency of IE to be different for linear and branched polymers. This is due 
to the fact that genealogical tree for the latter is much more non uniform. 

For self avoiding walks, in any dimension, the time to generate an independent sample 
of n steps T n ~ a^n 2 , independent of dimension for both IE and HE. For IE there is no 
significant change in with dimension. But for HE ~ dr 2 . In the limiting case of 
SAW on binary tree T n = n for HE. 

For branched polymers T n increases as exp(cn a ) with < a < 1 in all dimensions for 
both IE and HE. Redistributing weight does not change the value of a. HE works better 
than IE, but the difference is only in the coefficient c. The exponent a depends weakly 
on the dimension, its relation to the usually studied exponents of the branched polymer 
problem eg. 9, v is not clear at present. 
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As discussed earlier, the genealogical tree for cluster enumeration is not unique and 
one might argue that Martin's scheme is not the optimal choice. We tried to generate the 
genealogical tree using some variations of this rule, but we did not find any significant 
change in efficiency of the algorithm. 

For branched polymers, the degree of a node in the genealogical tree is not bounded, 
and the maximum degree increases with depth of the genealogical tree. However, the 
fractional number of nodes with high degree is very small. For genealogical tree corre- 
sponding to animals on a binary tree we find the fractional number of /c-nodes goes down 
exponentially with k for large k(Eq. (18)). Similar, behaviour was observed for branched 
polymers and directed branched polymers on a square lattice numerically. It is surprising 
that even an exponentially rare distribution of nodes with large degree seems to be enough 
to change the behaviour of efficiency of the algorithm on the tree. 

In the case of branched polymers, we found that the T n for IE varies as exp(cn a ) with 
< a < 1. While this is not very good, one can find problems for which IE's performance 
is even worse with a — 1. As an example, consider self avoiding walks on a disordered 
lattice, obtained by removing a fraction (1 — u) of bonds at random from a square lattice. 
It is known that the average number of self avoiding walks of length n varies as (u\) n [27], 
where A is the growth constant of the self avoiding walks on the same lattice with u — 1. 
Hence the growth constant of the corresponding genealogical tree would also be uX. Now 
if we consider a square lattice, the A ~ 2.638 and the bond percolation threshold is 1/2. 
For 1/A < u < 1/2, all clusters would be finite with probability 1, and the probability 
that cluster contains n sites would decrease exponentially with n. In this case, IE will be 
inefficient and even for best choice, T n will vary as exp(cn). 

One could argue that IE is a rather inefficient algorithm, which gives reasonable per- 
formance only for a small selected set of problems. We do not think so. In fact, the causes 
that make IE inefficient are also operative in the much larger class of genetic type algo- 
rithms. The high degree of correlations between different samples generated is a common 
feature of many of these algorithms which employ pruning and enrichment. For example, 
one could expect a similar behaviour to occur in the Berreti-Sokal algorithm [10], for 
branched polymers. The correlations arise because in all such 'evolutionary' type algo- 
rithms different samples generated often share a common ancestor in the past. Whether 
our results can be generalised to a larger class of PERM type algorithms seems to be an 
interesting question for further study. 
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7 Appendix 

As discussed in Sec.l, to enumerate all allowed configurations on a computer, one need 
a good exact enumeration algorithm which would generate all possible configurations 
exactly once, without needing to refer to what has been generated previously. Hence, 
one has to label the n-point configurations such that for any n-point configuration the 
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labelling is unique and on removing the last added site we must get an allowed (n — 1)- 
point configuration. For the self avoiding walks this can be easily achieved by labelling 
the first point of walk as 1, the second 2, and so on. But usually such natural choice of 
labelling doesn't exist for most problems. For lattice trees and animals, Martin discusses 
this in detail [17]. Here we describe briefly his algorithm for labelling a n-cluster. 

• Choose a rule for ordering the neighbours of any given site. For example, for DA 
on a square lattice (Fig.l ), we chose the rule that the upward neighbour is labelled 
before the right neighbour. For lattice animals on the binary tree we choose left 
neighbour before the right neighbour (Fig. 6). 

• We label the root as 1 and its neighbours are labelled 2, 3, 4.... in the order according 
to the priority rule. 

• When all points adjacent to point 1 have been labelled, label any still unlabelled 
points adjacent to point 2 according to the priority rule and then of point 3 and 
so on. This labeling hence induces a tree structure on the cluster which is the 
genealogical tree. 

The labelling described above is just one way of labelling the configurations. One can 
invent many other labelling schemes, which would give rise to different genealogical tree. 
But we find that the nature of genealogical tree depends on the underlying problem and 
not on the rules of labelling. 
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Figure Captions 

• Figurel:An example of a genealogical tree. The numbers labelling the sites indicate 
the order in which they are added (1 represents the root site). The tree shown is 
for directed lattice site animals on a square lattice. 

• Figure2:Plot of optimum values of pi on a (23, 233) tree of depth 30 

• Figure3: T n /n 2 of IE as a function of size n for SAW on a 2, 3 and 4 dimensional 
hyper cubic lattice. The lower most graph is for SAW on a square lattice and middle 
one in 3 — d and the topmost is for 4 — d. 

• Figure4: Probability of getting a walk of size n on a square lattice for IE and HE. 

• Figure5:T n /n 2 of HE Vs size n for SAW on a 2,3,4,5,6,7,8 and 10 dimensional 
hyper cubic lattice. 

• Figure6:First few levels of the genealogical tree for lattice animals on a binary tree. 
Solid circles represent the occupied sites and crossed circles denote blocked sites on 
the Bethe lattice. Dotted lines sketch the underlying Bethe lattice, whereas solid 
lines represent the bonds present. 

• Figure7:Plot of P{k, r) Vs scaled k - k*(r), for p = 0.25 and p = 0.25 ± 0.0001 and 
r = 100, 300 and 600. All the nine curves collapse to the same front profile. 

• Figure8:Plot of P(k + 1) as a function of P(k) at p = 1/4 for r = 25000, 26000, 28000 
and 30000. All the curves are very close and approach a limiting form with increasing 
r. The dotted line is just the line x = y. 

• Figure9:The width k*(r) of the travelling front as a function of r 1 / 3 for different 
values of p. The value of p increases from left to right. Curves of left of p = 1/4 are 
for p < 1/4 and the ones on right are for p > 1/4. For p — 1/4 the graph approaches 
a straight line as r — > oo. 
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FigurelO:Thick line is the plot of — log(P(2, r)) as a function of r 1 / 3 , when p is taken 
to be 1/4. The dotted line is a straight line of slope 2.47. 

Figurell:Plot of logT„ for m = 2/3, 5/6 and 1 as a function of x for n = 1000. 

Figurel2:Plot of —log(P(2,r)) Vs r 0A for lattice animals on a square lattice with 
IE and HE. 

Figurel3:Plot of — log(P(r)) Vs r 32 for directed animals with HE. 
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Figure 1: An example of a genealogical tree. The numbers labelling the sites indicate the 
order in which they are added (1 represents the root site). The tree shown is for directed 
lattice site animals on a square lattice. 
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Figure 2: Plot of optimum values of pi on a (23, 233) tree of depth 30 
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Figure 3: T n /n 2 of IE as a function of size n for SAW on a 2,3 and 4 dimensional hyper 
cubic lattice. The lower most graph is for SAW on a square lattice and middle one in 
3 — d and the topmost is for 4 — d. 
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Figure 5: T n /n 2 of HE Vs size n for SAW on a 2, 3, 4, 5, 6, 7, 8 and 10 dimensional hyper 
cubic lattice. 
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Figure 6: First few levels of the genealogical tree for lattice animals on a binary tree. Solid 
circles represent the occupied sites and crossed circles denote blocked sites on the Bethe 
lattice. Dotted lines sketch the underlying Bethe lattice, whereas solid lines represent the 
bonds present. 
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Figure 7: Plot of P(k,r) Vs scaled k - fc*(r), for p = 0.25 and p = 0.25 ± 0.0001 and 
r = 100, 300 and 600. All the nine curves collapse to the same front profile. 
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Figure 8: Plot of P(k + 1) as a function of P(k) at p = 1/4 for r = 25000, 26000, 28000 
and 30000. All the curves are very close and approach a limiting form with increasing r. 
The dotted line is just the line x = y. 
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Figure 9: The width k*(r) of the travelling front as a function of r 1 / 3 for different values 
of p. The value of p increases from left to right. Curves of left of p — 1/4 are for p < 1/4 
and the ones on right are for p > 1/4. For p — 1/4 the graph approaches a straight line 
as r — > oo. 
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Figure 10: Thick line is the plot of — log(P(2, r)) as a function of r 1 / 3 , when p is taken to 
be 1/4. The dotted line is a straight line of slope 2.47. 
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Figure 12: Plot of —log(P(2,r)) Vs r 0A for lattice animals on a square lattice with IE 
and HE 
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Figure 13: Plot of — log(P(r)) Vs r 32 for directed animals with HE. 
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